SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: Horvitz, Robert 
Yuan, Junying 
5 Shaham, Shai 

(ii) TITLE OF THE INVENTION: RELATEDNESS OF HUMAN INTERLEUKIN- 

BETA CONVERTASE GENE TO A C. ELEGANS CELL DEATH 
GENE, INHIBITORY PORTIONS OF THESE GENES AND... 

(iii) NUMBER OF SEQUENCES: 27 

10 (iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Clark & Elbing LLP 

(B) STREET: 176 Federal Street 

(C) CITY: Boston 

(D) STATE: MA 
15 (E) COUNTRY: USA 

(F) ZIP: 02110 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 
20 (C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 21-MAY-1995 
25 (C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/394,189 

(B) FILING DATE: 24 -FEB -1995 

(A) APPLICATION NUMBER: 08/282,211 
30 (B) FILING DATE: 12-JUL-1994 

(A) APPLICATION NUMBER: 07/984,182 

(B) FILING DATE: 20-NOV-1992 

(A) APPLICATION NUMBER: 07/897,788 

(B) FILING DATE: 12-JUN-1992 

35 (viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Bieker-Brady , Kristina 

(B) REGISTRATION NUMBER: 39,109 
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(C) REFERENCE /DOCKET NUMBER: 01997/211002 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617-428-0200 

(B) TELEFAX: 617-428-7045 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7653 base pairs 

(B) TYPE: nucleic acid 
10 <C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

AGATCTGAAA TAAGGTGATA AATTAATAAA TTAAGTGTAT TTCTGAGGAA ATTTGACTGT 60 

15 TTTAGCACAA TTAATCTTGT TTCAGAAAAA AAGTCCAGTT TTCTAGATTT TTCCGTCTTA 120 

TTGTCGAATT AATATCCCTA TTATCACTTT TTCATGCTCA TCCTCGAGCG GCACGTCCTC 180 

AAAGAATTGT GAGAGCAAAC GCGCTCCCAT TGACCTCCAC ACTCAGCCGC CAAAACAAAC 240 

GTTCGAACAT TCGTGTGTTG TGCTCCTTTT CCGTTATCTT GCAGTCATCT TTTGTCGTTT 300 

TTTTCTTTGT TCTTTTTGTT GAACGTGTTG CTAAGCAATT ATTACATCAA TTGAAGAAAA 360 

20 GGCTCGCCGA TTTATTGTTG CCAGAAAGAT TCTGAGATTC TCGAAGTCGA TTTTATAATA 420 

TTTAACCTTG GTTTTTGCAT TGTTTCGTTT AAAAAAACCA CTGTTTATGT GAAAAACGAT 480 

TAGTTTACTA ATAAAACTAC TTTTAAACCT TTACCTTTAC CTCACCGCTC CGTGTTCATG 540 

GCTCATAGAT TTTCGATACT CAAATCCAAA AATAAATTTA CGAGGGCAAT TAATGTGAAA 600 

CAAAAACAAT CCTAAGATTT CCACATGTTT GACCTCTCCG GCACCTTCTT CCTTAGCCCC 660 

2 5 ACCACTCCAT CACCTCTTTG GCGGTGTTCT TCGAAACCCA CTTAGGAAAG CAGTGTGTAT 720 

CTCATTTGGT ATGCTCTTTT CGATTTTATA GCTCTTTGTC GCAATTTCAA TGCTTTAAAC 780 

AATCCAAATC GCATTATATT TGTGCATGGA GGCAAATGAC GGGGTTGGAA TCTTAGATGA 840 

GATCAGGAGC TTTCAGGGTA AACGCCCGGT TCATTTTGTA CCACATTTCA TCATTTTCCT 900 

GTCGTCCTTG GTATCCTCAA CTTGTCCCGG TTTTGTTTTC GGTACACTCT TCCGTGATGC 960 

30 CACCTGTCTC CGTCTCAATT ATCGTTTAGA AATGTGAACT GTCCAGATGG GTGACTCATA 1020 

TTGCTGCTGC TACAATCCAC TTTCTTTTCT CATCGGCAGT CTTACGAGCC CATCATAAAC 1080 

TTTTTTTTCC GCGAAATTTG CAATAAACCG GCCAAAAACT TTCTCCAAAT TGTTACGCAA 1140 

TATATACAAT CCATAAGAAT ATCTTCTCAA TGTTTATGAT TTCTTCGCAG CACTTTCTCT 12 00 

TCGTGTGCTA ACATCTTATT TTTATAATAT TTCCGCTAAA ATTCCGATTT TTGAGTATTA 1260 

35 ATTTATCGTA AAATTATCAT AATAGCACCG AAAACTACTA AAAATGGTAA AAGCTCCTTT 1320 

TAAATCGGCT CGACATTATC GTATTAAGGA ATCACAAAAT TCTGAGAATG CGTACTGCGC 1380 

AACATATTTG ACGGCAAAAT ATCTCGTAGC GAAAACTACA GTAATTCTTT AAATGACTAC 1440 

TGTAGCGCTT GTGTCGATTT ACGGGCTCAA TTTTTGAAAA TAATTTTTTT TTTCGAATTT 1500 

TGATAACCCG TAAATCGTCA CAACGCTACA GTAGTCATTT AAAGGATTAC TGTAGTTCTA 1560 

4 0 GCTACGAGAT ATTTTGCGCG CCAAATATGA CTGTAATACG CATTCTCTGA ATTTTGTGTT 162 0 

TCCGTAATAA TTTCACAAGA TTTTGGCATT CCACTTTAAA GGCGCACAGG ATTTATTCCA 1680 

ATGGGTCTCG GCACGCAAAA AGTTTGATAG ACTTTTAAAT TCTCCTTGCA TTTTTAATTC 1740 

AATTACTAAA ATTTTCGTGA ATTTTTCTGT TAAAATTTTT AAAATCAGTT TTCTAATATT 18 00 

TTCCAGGCTG ACAAACAGAA ACAAAAACAC AACAAACATT TTAAAAATCA GTTTTCAAAT 1860 
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TAAAAATAAC 






GATTCAAAAC 






TTCAAGAGGG 






GCAGTTGTTG 




5 


AGCTGAATTT 






TTTCAGACTA 






ATCAGCCGAA 






TCTCTAGTCA 






ATAGTGATAA 




10 


AATTGATAAT 






AGATCGTGAA 






TTCGCTCTAC 






TTTTTAAAGT 






GCAACCCACC 




15 


TTGAAGTGAA 






ATTTTCGTGG 






ACACACATCT 






GTGTCCAATG 






TTCACCGACC 




20 


GGATATCTAC 






ACACAATTAT 


W. 




AACACTAAAT 


ffi 




GAAAAATACA 


ffi 




AAACGAATAT 


fit 


25 


TTGATATTTT 






CTTTTAAAGA 






TGAAAAAAAA 


r* 




TTTTTTTCAT 






TAGAAATTTT 




30 


TCATGTGGTT 






TCATGTTGTG 






ACGACAGCTC 






GCGTCTCTCG 






ATGTAAAAAA 




35 


ATTAAAATAC 






GAGCAACTTA 






TTTATTGATA 






GCGAAACACC 






TTTTTGCATT 




40 


GAGGATATAA 






AATTTGGGTA 






TCACCGGATG 






CTGGACCAAC 






TAAGCCGTGT 




45 


GCCTCATCAT 






ACAAGGACAA 






ATCTGACGGG 






CCGAAAATGT 






TTTGCAAAAA 




50 


GTTTTTTCCG 






ACCGTTTGTA 




GATTTCTCAT TGAAAATTGT 
AATTTTAACA AAAAAAAACC 
TCAAAATTTT CCGATTTTAC 
GAGTTTTTGA CGAAAACTAG 
TGAGGACAAT GTTTAAAAAA 
AATCGAAAAT CAAATCGTAC 
GATGATGCGT CAAGATAGAA 
TCTAAAAGTC GATGAAATTY 
TGGAGATATG ATTAATGTGA 
ATAAAGAATA TTTTTGCAGT 
AGCAGTGCAA CGACGGRGAG 
GGGACACGAA GGACTTGCTG 
TCGGCGCAAA AGCAAGGGTC 
GGCACGGTTT TTTCCTCCGA 
ATATATTTTA TTTACTGAAA 
CGCAAAAGGC CATTTTGTAG 
CCTTCAAATA TCCCTTTTTC 
TCACCGGCAA GCCATCGTCG 
CGAGTTCACC GTGACAGCGT 
TCAAGAGCAA GATCTCGTTC 
TCATCTCCTC CAGTCAACGC 
TCTGAGAATG CGCATTACTC 
GTAACCCTTT AAATGACTAT 
ATGCTCGAAT TGTGACAACG 
TGATCAATTA ATAAATTATT 
GTTACAGTAG TTTTCGCTTC 
TCATCTAACA TGTGCCAAAA 
TCAAGATATG CTTATTAACA 
GGGCTTTTCG TTCTAGTATG 
TGTTCATATG AATGACGAAA 
CAGAAAAATA GTAAAAAAGC 
ACTTCACATG CTGAAGACGA 
TCTTCAGCAT GTGAAATGGG 
TGCATGCGTT TTTTTACACT 
ATTTTTTGTA TTTTTCAACA 
AAAAGTAGAG AATATTAGAG 
ATTATAGATG TTAATAAGCA 
TGAAAAAAAT CAAAAATTCT 
TTTCTACATC ACATGAATGT 
TTGAATGAAA CATTGCGAAA 
TCAAAATCGA TCCTAAAACC 
CTCTTCTCTC GGATACAGTT 
TCAATACATA TTCCATGAAG 
TTTCGACGAG AAAACCATGT 
AAATAATGAA CACTTTGAGC 
TCTTACCAAT TTGTTCAGAT 
AAGGGTACGG CGAAATTATA 
GGCGCCCGGT CTCGACACGA 
ACAAAATTTT GAACTTCCGC 
GCTACATTAT GTGTTTTTTC 
AATTTTCAGA CAATTTTCCG 




GTTTTATGTT TGCGAAAATA 
CCAAAATTCG CCAGAAATCA 
TGACTTTCAC CTTTTTTTTC 
GAAAAAAATC GATAAAAATT 
AAACACTATT TTTCCAATAA 
TCTGACTACG GGTCAGTAGA 
GGAGCTTGCT AGAGAGGAAC 
TCGAAGTTCT CATCGCAAAA 
GTTTTTAATC GAATAATAAT 
CATGTGGAAC GGTTCGCGAG 
ATGTGGCGTT CGACGCGTTT 
AAGTTCTTGA ACCTCTCGCC 
TCACGGAAAA AAGAGGCGGA 
AAATCGGAAA TTATGCACTT 
GCTCGAGTGA TTATTTATTT 
ATTTGCCGAA AATACTTGTC 
CAGTGTTGAC TCGAATGCTG 
GAGCCGCGCA TTGAGCCCCG 
CTCTTCAGTG TCATCATTCA 
TCGATCGCGT GCACTTCATT 
ATTTCCCAGC CAACCTTGTA 
AACATATTTG ACGCGCAAAT 
TGTAGTGTCG ATTTACGGGC 
AATTTTAATT TGTCATTTTT 
TCCGTAAACA GACACCAGCG 
AAGATATTTT GAAAAGAATT 
CGCTTTTTTC AAGTTTCGCA 
CATATAATTA TCATTAATGT 
CTCTACTTTT GAAATTGCTC 
AATAGCAATT TTTTATATAT 
GCATGCATTT TTCGACATTT 
GAGACGCGGA GAAATACCAC 
ATCTCGGTCG ATGTAAAAAA 
TTTCTGCACA AATGAATAGG 
TCACATGATT AACCCCATTA 
CGAAAACCAA AATTTCTTCA 
TATCTTGAAT GAAAGTCAGC 
GCGAAAATTG AAAAAATGCA 
AGAAAATTAA AAGGGAAATC 
TTAAAATGTG CGAAACGTCA 
AACACATTTC AGCATCCGCC 
CAAGTCGTAA TCGCTCATTC 
AGGATATGAA CTTTGTCGAT 
ACAGAAACTT CTCGAGTCCT 
AGATGCCAAC ACGGAATGGT 
GCATGGGCTA TACGGTTATT 
TTACCCAAAC GCGAAATTTG 
CAATTTGTGT TAAATGCAAA 
GAAAATGATT TACCTAGTTT 
TTAGTTTTTC TATAATATTT 
CATACAAAAC TTGATAGCAC 



AAAGAGAACT 


1920 


AGATAAAAAA 


1980 


GTAGTTCAGT 


2040 


ACTCAAATCG 


2100 


TTTCACTCAT 


2160 


GAGGTCAACC 


2220 


ATTATGATGT 


2280 


CAAGTGTTGA 


2340 


TTTAAAAAAA 


2400 


AAGAGACGGG 


2460 


TATGATGCTC 


2520 


AGATCGTAGG 


2580 


TCGTAATTTT 


2640 


TCCCAAATAT 


2700 


TTTAACACTA 


2760 


ACACACACAC 


2820 


TCGAATTCGA 


2880 


CCGGCTACAC 


2940 


CTTCTTATCA 


3000 


CATCGGATCG 


3060 


TGTTGATGCG 


3120 


ATCTCGTAGC 


3180 


TCGATTTTCG 


3240 


GTGTTTTCTT 


3300 


CTACAGTACT 


3360 


TTAAACATTT 


3420 


GATTTTTTGA 


3480 


GAATTTCTTG 


3540 


AACGAAAAAA 


3600 


TTTCCCCTAT 


3660 


TTTACATCGA 


3720 


ACATCTTTCT 


3780 


ATGTCGAATA 


3840 


GGGAAAATGT 


3900 


TTTTTTCGTT 


3960 


AGATATTACC 


4020 


AAAAATATGT 


4080 


TTAAAATACA 


4140 


AAAATTTCTA 


4200 


AAAAAGAGGA 


4260 


AACTCTTCAT 


4320 


AGCAAAGCTT 


4380 


GCACCAACCA 


4440 


CGTGGAATGT 


4500 


ACCAAGGCCG 


4560 


TGCAAGGACA 


4620 


CCATTTTGCG 


4680 


AATGTATAAT 


4740 


CGAAATTTTC 


4800 


GATGTAAAAA 


4860 


GAAATCAATT 


4920 
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TTCTGAATTT TCAAAATTAT CCAAAAATGC ACAATTTAAA ATTTGTGAAA ATTGGCAAAC 4 980 

GGTGTTTCAA TATGAAATGT ATTTTTAAAA ACTTTAAAAA CCACTCCGGA AAAGCAATAA 5040 

AAATCAAAAC AACGTCACAA TTCAAATTCA AAAGTTATTC ATCCGATTTG TTTATTTTTG 5100 

CAAAATTTGA AAAAATCATG AAGGATTTAG AAAAGTTTTA TAACATTTTT TCTAGATTTT 5160 

5 TCAAAATTTT TTTTAACAAA TCGAGAAAAA GAGAATGAAA AATCGATTTT AAAAATATCC 5220 

ACAGCTTCGA GAGTTTGAAA TTACAGTACT CCTTAAAGGC GCACACCCCA TTTGCATTGG 52 80 

ACCAAAAATT TGTCGTGTCG AGACCAGGTA CCGTAGTTTT TGTCGCAAAA ATTGCACCAT 5340 

TGGACAATAA ACCTTCCTAA TCAC CAAAAA GTAAAATTGA AATCTTCGAA AAGCCAAAAA 54 00 

ATTCAAAAAA AAAGTCGAAT TTCGATTTTT TTTTTGGTTT TTTGGTCCCA AAAACCAAAA 5460 

10 AAATCAATTT TCTGCAAAAT ACCAAAAAGA AACCCGAAAA AATTTCCCAG CCTTGTTCCT 5520 

AATGTAAACT GATATTTAAT TTCCAGGGAA TGCTCCTGAC AATTCGAGAC TTTGCCAAAC 5580 

ACGAATCACA CGGAGATTCT GCGATACTCG TGATTCTATC ACACGGAGAA GAGAATGTGA 5640 

TTATTGGAGT TGATGATATA CCGATTAGTA CACACGAGAT ATATGATCTT CTCAACGCGG 5700 

CAAATGCTCC CCGTCTGGCG AATAAGCCGA AAATCGTTTT TGTGCAGGCT TGTCGARGCG 5760 

15 GTTCGTTTTT TATTTTAATT TTAATATAAA TATTTTAAAT AAATTCATTT TCAGAACGTC 5820 

GTGACAATGG ATTCCCAGTC TTGGATTCTG TCGACGGAGT TCCTGCATTT CTTCGTCGTG 5880 

GATGGGACAA TCGAGACGGG CCATTGTTCA ATTTTCTTGG ATGTGTGCGG CCGCAAGTTY 5940 

AGGTTGCAAT TTAATTTCTT GAATGAGAAT ATTCCTTCAA AAAATCTAAA ATAGATTTTT 6000 

ATTCCAGAAA GTCCCGATCG AAAAATTGCG ATATAATTAC GAAATTTGTG ATAAAATGAC 6060 

2 0 AAAC CAATCA GCATCGTCGA TCTCCGCCCA CTTCATCGGA TTGGTTTGAA AGTGGGCGGA 6120 

f} GTGAATTGCT GATTGGTCGC AGTTTTCAGT TTAGAGGGAA TTTAAAAATC GCCTTTTCGA 6180 

J3 AAATTAAAAA TTGATTTTTT CAATTTTTTC GAAAAATATT CCGATTATTT TATATTCTTT 624 0 

m GGAGCGAAAG CCCCGTCCTG TAAACATTTT TAAATGATAA TTAATAAATT TTTGCARCAA 6300 

m GTGTGGAGAA AGAAGCCGAG CYAAGCTGAC ATTCTGATTC GRTACGCAAC GACAGCTCAA 6360 

| 25 TATGTTTCGT GGAGAAACAG TGCTCGTGGA TCATGGTTCA TTCAAGCCGT CTGTGAAGTG 6420 

f{l TTCTCGACAC ACGYAAAGGA TATGGATGTT GTTGAGCTGC TGACTGAAGT CAATAAGAAG 64 80 

"J* GTCGYTTGTG GATTTCAGAC ATCACAGGGA TCGAATATTT TGAAACAGAT GCCARAGGTA 6540 

yj CTTGAAACAA ACAATGCATG TCTAACTTTT AAGGACACAG AAAAATAGGC AGAGGCTCCT 6600 

7 TTTGCAAGCC TGCCGCGCGT CAACCTAGAA TTTTAGTTTT TAGCTAAAAT GATTGATTTT 6660 

hk 30 GAATATTTTA TGCTAATTTT TTTGCGTTAA ATTTTGAAAT AGTCACTATT TATCGGGTTT 6720 

m CCAGTAAAAA ATGTTTATTA GCCATTGGAT TTTACTGAAA ACGAAAATTT GTAGTTTTTC 6780 

55 AACGAAATTT ATCGATTTTT AAATGTAAAA AAAAATAGCG AAAATTACAT CAACCATCAA 684 0 

GCATTTAAGC CAAAATTGTT AACTCATTTA AAAATTAATT CAAAGTTGTC CACGAGTATT 6900 

ACACGGTTGG CGCGCGGCAA GTTTGCAAAA CGACGCTCCG CCTCTTTTTC TGTGCGGCTT 6960 

h? 3 5 GAAAACAAGG GATCGGTTTA GATTTTTCCC CAAAATTTAA ATTAAATTTC AGATGACATY 7020 

^ CCGCCTGCTC AAAAAGTTCT ACTTTTGGCC GGAAGCACGA AACTCTGCCG TCTAAAATTC 70 80 

ACTCGTGATT CATTGCCCAA TTGATAATTG TCTGTATCTT CTCCCCCAGT TCTCTTTCGC 7140 

CCAATTAGTT TAAAACCATG TGTATATTGT TATCCTATAC TCATTTCACT TTATCATTCT 72 00 

ATCATTTCTC TTCCCATTTT CACACATTTC CATTTCTCTA CGATAATCTA AAATTATGAC 7260 

40 GTTTGTGTCT CGAACGCATA ATAATTTTAA TAACTCGTTT TGAATTTGAT TAGTTGTTGT 7320 

GCCCAGTATA TATGTATGTA CTATGCTTCT ATCAACAAAA TAGTTTCATA GATCATCACC 73 80 

CCAACCCCAC CAACCTACCG TACCATATTC ATTTTTGCCG GGAATCAATT TCGATTAATT 7440 

TTAACCTATT TTTTCGCCAC AAAAAATCTA ATATTTGAAT TAACGAATAG CATTCCCATC 75 00 

TCTCCCGTGC CGGAATGCCT CCCGGCCTTT TAAAGTTCGG AACATTTGGC AATTATGTAT 7560 

45 AAATTTGTAG GTCCCCCCCA TCATTTCCCG CCCATCATST CAAATTGCAT TCTTTTTTCG 762 0 

CCGTGATATC CCGATTCTGG TCAGCAAAGA TCT 7653 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 03 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
5 (ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Met Arg Gin Asp Arg Arg Ser Leu Leu Glu Arg Asn lie Met Met 

15 10 15 

Phe Ser Ser His Leu Lys Val Asp Glu lie Xaa Glu Val Leu lie Ala 
10 20 25 30 

Lys Gin Val Leu Asn Ser Asp Asn Gly Asp Met lie Asn Ser Cys Gly 

35 40 45 

Thr Val Arg Glu Lys Arg Arg Glu lie Val Lys Ala Val Gin Arg Arg 
50 55 60 

15 Xaa Asp Val Ala Phe Asp Ala Phe Tyr Asp Ala Leu Arg Ser Thr Gly 
65 70 75 80 

His Glu Gly Leu Ala Glu Val Leu Glu Pro Leu* Ala Arg Ser Val Asp 
85 90 95 

p Ser Asn Ala Val Glu Phe Glu Cys Pro Met Ser Pro Ala Ser His Arg 

B 20 100 105 110 

Jf Arg Ser Arg Ala Leu Ser Pro Ala Gly Tyr Thr Ser Pro Thr Arg Val 

U 115 120 125 

His Arg Asp Ser Val Ser Ser Val Ser Ser Phe Thr Ser Tyr Gin Asp 
13 0 13 5 14 0 

g 2 5 lie Tyr Ser Arg Ala Arg Ser Arg Ser Arg Ser Arg Ala Leu His Ser 
lj 145 150 155 160 

Ser Asp Arg His Asn Tyr Ser Ser Pro Pro Val Asn Ala Phe Pro Ser 
5| 165 170 175 

f§ Gin Pro Ser Ser Ala Asn Ser Ser Phe Thr Gly Cys Ser Ser Leu Gly 

1| 30 180 185 190 

=*! Tyr Ser Ser Ser Arg Asn Arg Ser Phe Ser Lys Ala Ser Gly Pro Thr 

^ 195 200 205 

^ Gin Tyr lie Phe His Glu Glu Asp Met Asn Phe Val Asp Ala Pro Thr 

210 215 220 

35 lie Ser Arg Val Phe Asp Glu Lys Thr Met Tyr Arg Asn Phe Ser Ser 
225 230 235 240 

Pro Arg Gly Met Cys Leu lie lie Asn Asn Glu His Phe Glu Gin Met 

245 250 255 

Pro Thr Arg Asn Gly Thr Lys Ala Asp Lys Asp Asn Leu Thr Asn Leu 
40 260 265 270 

Phe Arg Cys Met Gly Tyr Thr Val lie Cys Lys Asp Asn Leu Thr Gly 

275 280 * 285 

Arg Gly Met Leu Leu Thr lie Arg Asp Phe Ala Lys His Glu Ser His 
290 295 300 

45 Gly Asp Ser Ala lie Leu Val lie Leu Ser His Gly Glu Glu Asn Val 
305 310 315 320 

lie lie Gly Val Asp Asp lie Pro lie Ser Thr His Glu lie Tyr Asp 
325 330 335 



=33 5 
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Leu Leu Asn Ala Ala Asn Ala Pro Arg Leu Ala Asn Lys Pro Lys lie 

340 345 350 

Val Phe Val Gin Ala Cys Arg Xaa Glu Arg Arg Asp Asn Gly Phe Pro 
355 360 365 

5 Val Leu Asp Ser Val Asp Gly Val Pro Ala Phe Leu Arg Arg Gly Trp 
370 375 380 

Asp Asn Arg Asp Gly Pro Leu Phe Asn Phe Leu Gly Cys Val Arg Pro 
385 390 395 400 

Gin Val Xaa Gin Val Trp Arg Lys Lys Pro Ser Xaa Ala Asp lie Leu 
10 405 410 415 

lie Arg Tyr Ala Thr Thr Ala Gin Tyr Val Ser Xaa Arg Asn Ser Ala 

420 425 430 

Arg Gly Ser Trp Phe lie Gin Ala Val Cys Glu Val Phe Ser Thr His 
435 440 445 

15 Xaa Lys Asp Met Asp Val Val Glu Leu Leu Thr Glu Val Asn Lys Lys 
450 455 460 

Val Xaa Cys Gly Phe Gin Thr Ser Gin Gly Ser Asn lie Leu Lys Gin 
465 470 475 480 

Met Pro Xaa Met Thr Xaa Arg Leu Leu Lys Lys Phe Tyr Phe Trp Pro 
20 485 490 495 

Glu Ala Arg Asn Ser Ala Val 
500 

(2) INFORMATION FOR SEQ ID NO:3: 



(i) SEQUENCE CHARACTERISTICS: 
U 25 (A) LENGTH: 1373 base pairs 

=P* (B) TYPE: nucleic acid 

jj (C) STRANDEDNESS : single 

s (D) TOPOLOGY: linear 

31 (ii) MOLECULE TYPE: cDNA 

Tj 3 0 (ix) FEATURE: 

p (A) NAME /KEY: Coding Sequence 

-II (B) LOCATION: 18... 1229 

(D) OTHER INFORMATION: /product= "human interleukin-1 beta 
convertase" 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

' AAAAGGAGAG AAAAGCC ATG GCC GAC AAG GTC CTG AAG GAG AAG AGA AAG 50 

Met Ala Asp Lys Val Leu Lys Glu Lys Arg Lys 
1 5 10 

40 CTG TTT ATC CGT TCC ATG GGT GAA GGT ACA ATA AAT GGC TTA CTG GAT 98 
Leu Phe lie Arg Ser Met Gly Glu Gly Thr lie Asn Gly Leu Leu Asp 
15 20 25 

GAA TTA TTA CAG ACA AGG GTG CTG AAC AAG GAA GAG ATG GAG AAA GTA 146 
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Glu Leu Leu Gin Thr Arg Val Leu Asn Lys Glu Glu Met Glu Lys Val 
30 35 40 

AAA CGT GAA AAT GCT ACA GTT ATG GAT AAG ACC CGA GCT TTG ATT GAC 194 
Lys Arg Glu Asn Ala Thr Val Met Asp Lys Thr Arg Ala Leu lie Asp 
5 45 50 55 

TCC GTT ATT CCG AAA GGG GCA CAG GCA TGC CAA ATT TGC ATC ACA TAC 242 
Ser Val lie Pro Lys Gly Ala Gin Ala Cys Gin lie Cys lie Thr Tyr 
60 65 70 75 

ATT TGT GAA GAA GAC AGT TAC CTG GCA GGG ACG CTG GGA CTC TCA GCA 290 
10 lie Cys Glu Glu Asp Ser Tyr Leu Ala Gly Thr Leu Gly Leu Ser Ala 

80 85 90 

GAT CAA ACA TCT GGA AAT TAC CTT AAT ATG CAA GAC TCT CAA GGA GTA 33 8 

Asp Gin Thr Ser Gly Asn Tyr Leu Asn Met Gin Asp Ser Gin Gly Val 
95 100 105 

15 CTT TCT TCC TTT CCA GCT CCT CAG GCA GTG CAG GAC AAC CCA GCT ATG 386 
Leu Ser Ser Phe Pro Ala Pro Gin Ala Val Gin Asp Asn Pro Ala Met 
110 115 120 

CCC ACA TCC TCA GGC TCA GAA GGG AAT GTC AAG CTT TGC TCC CTA GAA 434 
Pro Thr Ser Ser Gly Ser Glu Gly Asn Val Lys Leu Cys Ser Leu Glu 
20 125 130 135 

GAA GCT CAA AGG ATA TGG AAA CAA AAG TCG GCA GAG ATT TAT CCA ATA 482 
Glu Ala Gin Arg lie Trp Lys Gin Lys Ser Ala Glu lie Tyr Pro lie 
140 145 150 155 

ATG GAC AAG TCA AGC CGC ACA CGT CTT GCT CTC ATT ATC TGC AAT GAA 53 0 

25 Met Asp Lys Ser Ser Arg Thr Arg Leu Ala Leu lie lie Cys Asn Glu 

160 165 170 

GAA TTT GAC AGT ATT CCT AGA AGA ACT GGA GCT GAG GTT GAC ATC ACA 578 
Glu Phe Asp Ser lie Pro Arg Arg Thr Gly Ala Glu Val Asp lie Thr 
175 180 185 

3 0 GGC ATG ACA ATG CTG CTA CAA AAT CTG GGG TAC AGC GTA GAT GTG AAA 62 6 

Gly Met Thr Met Leu Leu Gin Asn Leu Gly Tyr Ser Val Asp Val Lys 
190 195 200 

AAA AAT CTC ACT GCT TCG GAC ATG ACT ACA GAG CTG GAG GCA TTT GCA 674 
Lys Asn Leu Thr Ala Ser Asp Met Thr Thr Glu Leu Glu Ala Phe Ala 
35 205 210 215 

CAC CGC CCA GAG CAC AAG ACC TCT GAC AGC ACG TTC CTG GTG TTC ATG 722 
His Arg Pro Glu His Lys Thr Ser Asp Ser Thr Phe Leu Val Phe Met 
220 225 230 235 
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TCT CAT GGT ATT CGG GAA GGC ATT TGT GGG AAG AAA CAC TCT GAG CAA 770 
Ser His Gly lie Arg Glu Gly lie Cys Gly Lys Lys His Ser Glu Gin 
240 245 250 

GTC CCA GAT ATA CTA CAA CTC AAT GCA ATC TTT AAC ATG TTG AAT ACC 818 
5 Val Pro Asp lie Leu Gin Leu Asn Ala lie Phe Asn Met Leu Asn Thr 
255 260 265 

AAG AAC TGC CCA AGT TTG AAG GAC AAA CCG AAG GTG ATC ATC ATC CAG 866 
Lys Asn Cys Pro Ser Leu Lys Asp Lys Pro Lys Val lie lie lie Gin 
270 275 280 

10 GCC TGC CGT GGT GAC AGC CCT GGT GTG GTG TGG TTT AAA GAT TCA GTA 914 
Ala Cys Arg Gly Asp Ser Pro Gly Val Val Trp Phe Lys Asp Ser Val 
285 290 295 

GGA GTT TCT GGA AAC CTA TCT TTA CCA ACT ACA GAA GAG TTT GAG GAT 962 
Gly Val Ser Gly Asn Leu Ser Leu Pro Thr Thr Glu Glu Phe Glu Asp 
15 300 305 310 315 

GAT GCT ATT AAG AAA GCC CAC ATA GAG AAG GAT TTT ATC GCT TTC TGC 1010 
O Asp Ala lie Lys Lys Ala His He Glu Lys Asp Phe He Ala Phe Cys 

# 320 325 330 

33 TCT TCC ACA CCA GAT AAT GTT TCT TGG AGA CAT CCC ACA ATG GGC TCT 1058 

CEf 20 Ser Ser Thr Pro Asp Asn Val Ser Trp Arg His Pro Thr Met Gly Ser 

fU 335 340 345 

b| GTT TTT ATT GGA AGA CTC ATT GAA CAT ATG CAA GAA TAT GCC TGT TCC 1106 

s" Val Phe He Gly Arg Leu He Glu His Met Gin Glu Tyr Ala Cys Ser 

Q 350 355 360 

m 25 TGT GAT GTG GAG GAA ATT TTC CGC AAG GTT CGA TTT TCA TTT GAG CAG 1154 

?y Cys Asp Val Glu Glu He Phe Arg Lys Val Arg Phe Ser Phe Glu Gin 

p 365 370 375 

CCA GAT GGT AGA GCG CAG ATG CCC ACC ACT GAA AGA GTG ACT TTG ACA 1202 
Pro Asp Gly Arg Ala Gin Met Pro Thr Thr Glu Arg Val Thr Leu Thr 
30 380 385 390 395 

AGA TGT TTC TAC CTC TTC CCA GGA CAT TAAAATAAGG AAACTGTATG AATGTCT 1256 
Arg Cys Phe Tyr Leu Phe Pro Gly His 
400 

GCGGGCAGGA AGTGAAGAGA TCGTTCTGTA AAAGGTTTTT GGAATTATGT CTGCTGAATA 1316 

3 5 ATAAACTTTT TTTGAAATAA TAAATCTGGT AGAAAAATGA AAAAAAAAAA AAAAAAA 13 73 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 404 amino acids 
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# 



10 



15 

S 20 

z^?z 25 



ry 30 



35 



40 



(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met Ala Asp Lys Val Leu Lys Glu Lys Arg Lys Leu Phe lie Arg Ser 

15 10 15 

Met Gly Glu Gly Thr lie Asn Gly Leu Xaa Asp Glu Leu Leu Gin Thr 

20 25 30 

Arg Val Leu Asn Lys Glu Glu Met Glu Lys Val Lys Arg Glu Asn Ala 

35 40 45 

Thr Val Met Asp Lys Thr Arg Ala Leu lie Asp Ser Val lie Pro Lys 

50 55 60 

Xaa Ala Gin Ala Cys Gin lie Cys lie Thr Tyr lie Cys Glu Glu Asp 
65 70 75 80 

Ser Tyr Leu Ala Gly Thr Leu Gly Leu Ser Ala Asp Gin Thr Ser Gly 

85 90 95 

Asn Tyr Leu Asn Met Gin Asp Ser Gin Gly Val Leu Ser Ser Phe Pro 

100 105 110 

Ala Pro Gin Ala Val Gin Asp Asn Pro Ala Met Pro Thr Ser Ser Gly 

115 120 125 

Ser Glu Gly Asn Val Lys Leu Cys Ser Leu Glu Glu Ala Gin Arg lie 

130 135 140 

Trp Lys Gin Lys Ser Ala Glu lie Tyr Pro lie Met Asp Lys Ser Ser 
145 150 155 160 

Arg Thr Arg Leu Ala Leu lie lie Cys Asn Glu Glu Phe Asp Ser lie 

165 170 175 

Pro Arg Arg Thr Gly Ala Glu Val Asp lie Thr Gly Met Thr Met Leu 

180 185 190 

Leu Gin Asn Leu Gly Tyr Ser Val Asp Val Lys Lys Asn Leu Thr Ala 

195 200 205 

Ser Asp Met Thr Thr Glu Leu Glu Ala Phe Ala His Arg Pro Glu His 

210 215 220 

Lys Thr Ser Asp Ser Thr Phe Leu Val Phe Met Ser His Gly lie Arg 
225 230 235 240 

Glu Gly lie Cys Gly Lys Lys His Ser Glu Gin Val Pro Asp lie Leu 

245 250 255 

Gin Leu Asn Ala lie Phe Asn Met Leu Asn Thr Lys Asn Cys Pro Ser 

260 265 270 

Leu Lys Asp Lys Pro Lys Val lie lie lie Gin Ala Xaa Arg Xaa Asp 

275 280 * 285 

Ser Pro Gly Val Val Trp Phe Lys Asp Ser Val Gly Val Ser Gly Asn 

290 295 300 

Leu Ser Leu Pro Thr Thr Glu Glu Phe Glu Asp Asp Ala lie Lys Lys 
305 310 315 320 

Ala His lie Xaa Lys Asp Phe lie Ala Phe Cys Ser Ser Thr Pro Asp 



325 



330 



335 
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# 



10 



Asn Val Ser Xaa Arg His Pro Thr Met Gly Ser Val Phe He Gly Arg 

340 345 350 

Leu He Glu His Met Gin Glu Tyr Xaa Cys Ser Cys Asp Val Glu Glu 

355 360 365 

He Phe Arg Lys Val Arg Phe Ser Phe Glu Gin Pro Asp Gly Arg Ala 

370 375 380 

Gin Met Pro Thr Thr Xaa Arg Val Xaa Leu Thr Arg Cys Phe Tyr Leu 
385 390 395 400 

Phe Pro Gly His 



(2) INFORMATION FOR SEQ ID NO:5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 505 amino acids 

(B) TYPE: amino acid 

15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE! : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Met Met Arg Gin Asp Arg Trp Leu Leu Glu Arg Asn He Leu Glu Phe 
20 1 5 10 15 

Ser Ser Lys Leu Gin Ala Asp Leu He Leu Asp Val Leu He Ala Lys 

20 25 30 

Gin Val Leu Asn Ser Asp Asn Gly Asp Val He Asn Ser Cys Arg Thr 
35 40 45 

25 Glu Arg Asp Asn Glu Lys Glu He Val Lys Ala Val Gin Arg Arg Gly 
50 55 60 

Asp Glu Ala Phe Asp Ala Phe Tyr Asp Ala Leu Arg Asp Thr Gly His 
65 70 75 80 

Asn Asp Leu Ala Asp Val Leu Met Pro Leu Ser Arg Pro Xaa Xaa Xaa 
30 85 90 95 

Asn Pro Val Pro Met Glu Cys Pro Met Ser Pro Ser Ser His Arg Arg 

100 105 110 

Ser Arg Ala Leu Ser Pro Pro Xaa Tyr Ala Ser Pro Thr Arg Val His 
115 120 125 

35 Arg Asp Ser He Ser Ser Val Ser Ser Phe Thr Ser Thr Tyr Gin Asp 
130 135 140 

Val Tyr Ser Arg Ala Arg Ser Ser Ser Arg Ser Ser Arg Pro Leu Gin 
145 150 155 160 

Ser Ser Asp Arg His Asn Tyr Met Ser Ala Ala Thr Ser Phe Pro Ser 
40 165 170 * 175 

Gin Pro Xaa Ser Ala Asn Ser Ser Phe Thr Gly Cys Ala Ser Leu Gly 

180 185 190 

Tyr Ser Ser Ser Arg Asn Arg Ser Phe Ser Lys Thr Ser Ala Gin Ser 
195 200 205 

45 Gin Tyr He Phe His Glu Glu Asp Met Asn Tyr Val Asp Ala Pro Thr 
210 215 220 
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He His Arg Val Phe Asp Glu Lys Thr Met Tyr Arg Asn Phe Ser Ser 
225 230 235 240 

Pro Arg Gly Leu Cys Leu He He Asn Asn Glu His Phe Glu Gin Met 
245 250 255 

5 Pro Thr Arg Asn Gly Thr Lys Ala Asp Lys Asp Asn Leu Thr Asn He 
260 265 270 

Phe Arg Cys Met Gly Tyr Thr Val He Cys Lys Asp Asn Leu Thr Gly 

275 280 285 

Arg Glu Met Leu Ser Thr He Arg Ser Phe Gly Arg Asn Asp Met His 
10 290 295 300 

Gly Asp Ser Ala He Leu Val He Leu Ser His Gly Glu Xaa Asn Val 
305 310 315 320 

He He Gly Val Asp Asp Val Ser Val Asn Val His Glu He Tyr Asp 
325 330 335 

15 Leu Leu Asn Ala Ala Asn Ala Pro Arg Leu Ala Asn Lys Pro Lys Leu 

340 345 350 

Val Phe Val Gin Ala Cys Arg Gly Glu Arg Arg Asp Asn Gly Phe Pro 

355 360 365 

Val Leu Asp Ser Val Asp Gly Val Pro Ser Leu He Arg Arg Gly Trp 
20 370 375 380 

Asp Asn Arg Asp Gly Pro Leu Phe Asn Phe Leu Gly Cys Val Arg Pro 
O 385 390 395 400 

yif Gin Val Gin Gin Val Trp Arg Lys Lys Pro Ser Gin Ala Asp Met Leu 

m 405 410 415 

fjj 25 He Ala Tyr Ala Thr Thr Ala Gin Tyr Val Ser Trp Arg Asn Ser Ala 
83 420 425 430 

ffj Arg Gly Ser Trp Phe He Gin Ala Val Cys Glu Val Phe Ser Leu His 

jj 435 440 445 

yi Ala Lys Asp Met Asp Val Val Glu Leu Leu Thr Glu Val Asn Lys Lys 

I"" 30 450 455 460 

Val Ala Cys Gly Phe Gin Thr Ser Gin Gly Ser Asn He Leu Lys Gin 
S 465 470 475 480 

5 I Met Pro Glu Leu Thr Ser Arg Leu Leu Lys Lys Phe Tyr Phe Trp Pro 

L?5 485 490 495 

35 Glu Asp Arg Gly Arg Asn Ser Ala Val 
!f 500 505 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 479 amino acids 

40 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

45 Thr Val Ser Leu Ser Leu He He Ala Arg Gin Val Leu Asn Ser Asp 
15 10 15 
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Asn Gly Asp Met He Asn Ser Cys Arg Thr Glu Arg Asp Asn Glu Lys 

20 25 30 

Glu He Val Lys Ala Val Gin Arg Arg Gly Asp Glu Ala Phe Asp Ala 

35 40 45 

Phe Tyr Asp Ala Leu Arg Asp Thr Gly His Asn Asp Leu Ala Asp Val 

50 55 60 

Leu Met Pro Leu Ser Arg Pro Val Asp Ser Asn Pro Val Pro Met Glu 
65 70 75 80 

Cys Pro Met Ser Pro Ser Ser His Arg Arg Ser Arg Ala Leu Ser Pro 

85 90 95 

Pro Xaa Tyr Ala Ser Pro Thr Arg Val His Arg Asp Ser He Ser Ser 

100 105 HO 

Val Ser Ser Phe Thr Ser Thr Tyr Gin Asp Val Tyr Ser Arg Ala Thr 

115 120 125 

Ser Ser Ser Pro Leu Gin Thr Ser Asp Arg His Asn Tyr Val Ser Ala 

130 135 140 

Ser Thr Ser Phe Gin Ser Gin Pro Ala Ser Ala Asn Ser Ser Phe Thr 
145 150 155 160 

Gly Ser Ala Ser Leu Gly Tyr Ser Ser Ser Arg Thr Arg Ser Tyr Ser 

165 170 175 

Lys Thr Ser Ala His Ser Gin Tyr He Phe His Glu Glu Asp Met Asn 

180 185 190 

Tyr Val Asp Ala Pro Thr He His Arg Val Phe Asp Glu Lys Thr Met 

195 200 205 

Tyr Arg Asn Phe Ser Thr Pro Arg Gly Leu Cys Leu He He Asn Asn 

210 215 220 

Glu His Phe Glu Gin Met Pro Thr Arg Asn Gly Thr Lys Pro Asp Lys 
225 230 235 240 

Asp Asn He Ser Asn He Phe Arg Cys Met Gly Tyr He Val His Cys 

245 250 255 

Lys Asp Asn Leu Thr Gly Arg Glu Met Met Ser Thr He Arg Ser Phe 

260 265 270 

Gly Arg Asn Asp Thr His Gly Asp Ser Ala He Leu Val He Leu Ser 

275 280 285 

His Gly Glu Xaa Asn Val He He Gly Val Asp Asp Val Ser Val Asn 

290 295 300 

Val His Glu He Tyr Xaa Leu Leu Asn Ala Ala Asn Ala Pro Arg Leu 
305 310 315 320 

Ala Asn Lys Pro Lys Leu Val Phe Val Gin Ala Cys Arg Gly Glu Arg 

325 330 335 

Arg Asp Val Gly Phe Pro Val Leu Asp Ser Val Asp Gly Val Pro Ser 

340 345 350 

Leu He Arg Arg Gly Trp Asp Lys Gly Asp Gly Pro Leu Phe Asn Phe 

355 360 365 

Leu Gly Cys Val Arg Pro Gin Ala Gin Gin Val Trp Arg Lys Lys Pro 

370 375 380 

Ser Gin Ala Asp Met Leu He Ala Tyr Ala Thr Thr Ala Gin Tyr Val 
385 390 395 400 

Ser Trp Arg Asn Ser Ala Arg Gly Ser Trp Phe He Gin Ala Val Cys 

405 410 415 

Glu Val Phe Ser Leu His Ala Lys Asp Met Asp Val Val Glu Leu Leu 
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420 425 430 

Thr Glu Val Asn Lys Lys Val Ala Cys Gly Phe Gin Thr Ser Gin Gly 

435 440 445 

Ala Asn He Leu Lys Gin Met Pro Glu Leu Thr Ser Arg Leu Leu Lys 

450 455 460 

Lys Phe Tyr Phe Trp Pro Glu Asp Arg Asn Arg Ser Ser Ala Val 
465 470 475 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
TCATCGACTT TTAGATGACT AGAGAACATC 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
GTTGCACTGC TTTCACGATC TCCCGTCTCT 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

GTTTAATTAC CCAAGTTTGA G 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GGTTTTAACC AGTTACTCAA G 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CCGGTGACAT TGGACACTC 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 
ACTATTCAAC ACTTG 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 171 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa at position 117 is Ala or Val 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Leu Thr Val Gin Val Tyr Arg Thr Ser Gin Lys Cys Ser Ser Ser 
15 10 15 
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Lys His Val Val Glu Val Leu Leu Asp Pro Leu Gly Thr Ser Phe Cys 

20 25 30 

Ser Leu Leu Pro Pro Pro Leu Leu Leu Tyr Glu Thr Asp Arg Gly Val 
35 40 45 

5 Asp Gin Gin Asp Gly Lys Asn His Thr Gin Ser Pro Gly Cys Glu Glu 
50 55 60 

Ser Asp Ala Gly Lys Glu Glu Leu Met Lys Met Arg Leu Pro Thr Arg 
65 70 75 80 

Ser Asp Met He Cys Gly Tyr Ala Cys Leu Lys Gly Asn Ala Ala Met 
10 85 90 95 

Arg Asn Thr Lys Arg Gly Ser Trp Tyr He Glu Ala Leu Thr Gin Val 

100 105 110 

Phe Ser Glu Arg Xaa Cys Asp Met His Val Ala Asp Met Leu Val Lys 
115 120 125 

15 Val Asn Ala Leu He Lys Glu Arg Glu Gly Tyr Ala Pro Gly Thr Glu 
130 135 140 

Phe His Arg Cys Lys Glu Met Ser Glu Tyr Cys Ser Thr Leu Cys Gin 
145 150 155 160 

Gin Leu Tyr Leu Phe Pro Gly Tyr Pro Pro Thr 
20 165 170 

(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 402 amino acids 

(B) TYPE: amino acid 

25 (C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14; 

Met Ala Asp Lys He Leu Arg Ala Lys Arg Lys Gin Phe He Asn Ser 
30 1 5 10 15 

Val Ser He Gly Thr He Asn Gly Leu Leu Asp Glu Leu Leu Glu Lys 

20 25 30 

Arg Val Leu Asn Gin Glu Glu Met Asp Lys He Lys Leu Ala Asn He 
35 40 45 

3 5 Thr Ala Met Asp Lys Ala Arg Asp Leu Cys Asp His Val Ser Lys Lys 
50 55 60 

Gly Pro Gin Ala Ser Gin He Phe He Thr Tyr He Cys Asn Glu Asp 
65 70 75 80 

Cys Tyr Leu Ala Gly He Leu Glu Leu Gin Ser Ala Pro Ser Ala Glu 
40 85 90 - 95 

Thr Phe Val Ala Thr Glu Asp Ser Lys Gly Gly His Pro Ser Ser Ser 

100 105 110 

Glu Thr Lys Glu Glu Gin Asn Lys Glu Asp Gly Thr Phe Pro Gly Leu 
115 120 125 

45 Thr Gly Thr Leu Lys Phe Cys Pro Leu Glu Lys Ala Gin Lys Leu Trp 
130 135 140 
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Lys Glu Asn Pro Ser Glu lie Tyr Pro lie Met Asn Thr Thr Thr Arg 
145 150 155 160 

Thr Arg Leu Ala Leu lie lie Cys Asn Thr Glu Phe Gin His Leu Ser 
165 170 175 

5 Pro Arg Val Gly Ala Gin Val Asp Leu Arg Glu Met Lys Leu Leu Leu 
180 185 190 

Glu Asp Leu Gly Tyr Thr Val Lys Val Lys Glu Asn Leu Thr Ala Leu 

195 200 205 

Glu Met Val Lys Glu Val Lys Glu Phe Ala Ala Cys Pro Glu His Lys 
10 210 215 220 

Thr Ser Asp Ser Thr Phe Leu Val Phe Met Ser His Gly lie Gin Glu 
225 230 235 240 

Gly lie Cys Gly Thr Thr Tyr Ser Asn Glu Val Ser Asp lie Leu Lys 
245 250 255 

15 Val Asp Thr lie Phe Gin Met Met Asn Thr Leu Lys Cys Pro Ser Leu 

260 265 270 

Lys Asp Lys Pro Lys Val lie lie lie Gin Ala Cys Arg Gly Glu Lys 

275 280 285 

Gin Gly Val Val Leu Leu Lys Asp Ser Val Arg Asp Ser Glu Glu Asp 
20 290 295 300 

p% Phe Leu Thr Asp Ala lie Phe Glu Asp Asp Gly He Lys Lys Ala His 

% 305 310 315 320 

5 He Glu Lys Asp Phe He Ala Phe Cys Ser Ser Thr Pro Asp Asn Val 

SJf 325 330 335 

25 Ser Trp Arg His Pro Val Arg Gly Ser Leu Phe He Glu Ser Leu He 
5: 340 345 350 

; J Lys His Met Lys Glu Tyr Ala Trp Ser Cys Asp Leu Glu Asp He Phe 

* 355 360 365 

m Arg Lys Val Arg Phe Ser Phe Glu Gin Pro Glu Phe Arg Leu Gin Met 

1^ 30 370 375 380 

M Pro Thr Ala Asp Arg Val Thr Leu Thr Lys Arg Phe Tyr Leu Phe Pro 

5;- 385 390 395 400 

Hj Gly His 



M* 35 (2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa at position 3 is Ala, His, Gin, 
Lys, Phe, Cha or Asp. 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Tyr Val Xaa Asp 
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(2) INFORMATION FOR SEQ ID NO: 16: 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 amino acids 
5 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

10 (D) OTHER INFORMATION: Peptide has an acetyl group at the 

amino terminus and an aldehyde group at the carboxyl terminus. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Tyr Val Ala Asp 
1 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 17 : 

Tyr Val Ala Asp 
1 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
3 0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa at position 2 is D-Ala. Peptide 
has an acetyl group at the amino terminus and an aldehyde group at 

35 the carboxyl terminus. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
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Tyr Xaa Ala Asp 
1 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
10 (ix) FEATURE: 

(D) OTHER INFORMATION: Peptide has an acetyl group at the 
amino terminus and an aldehyde group at the carboxyl terminus. 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Tyr Val Lys Asp 
15 1 

O (2) INFORMATION FOR SEQ ID NO: 20: 

B (i) SEQUENCE CHARACTERISTICS: 

03 (A) LENGTH: 354 amino acids 

03 (B) TYPE: amino acid 

fij 2 0 (C) STRANDEDNESS: single 

*p (D) TOPOLOGY: linear 

s (ii) MOLECULE TYPE: protein 

Q (v) FRAGMENT TYPE: C-terminal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



25 


Arg 
1 


Ser 


Arg 


Ser 


Arg 
5 


Ser 


Arg 


Ala 


Leu 


His 
10 


Ser 


Ser Asp Arg 


His 
15 


Asn 




Tyr 


Ser 


Ser 


Pro 


Pro 


Val 


Asn 


Ala 


Phe 


Pro 


Ser 


Gin Pro Ser 


Ser 


Ala 








20 










25 






30 








Asn 


Ser 


Ser 


Phe 


Thr 


Gly 


Cys 


Ser 


Ser 


Leu 


Gly 


Tyr Ser Ser 


Ser 


Arg 


30 






35 










40 








45 








Asn Arg 


Ser 


Phe 


Ser 


Lys 


Ala 


Ser 


Gly 


Pro 


Thr 


Gin Tyr He 


Phe 


His 






50 










55 










60 








Glu 


Glu 


Asp 


Met 


Asn 


Phe 


Val 


Asp 


Ala 


Pro 


Thr 


He Ser Arg 


Val 


Phe 




65 










70 










75 






80 


35 


Asp 


Glu 


Lys 


Thr 


Met 
85 


Tyr 


Arg 


Asn 


Phe 


Ser 
90 


Ser 


Pro Arg Gly 


Met 
95 


Cys 




Leu 


He 


He 


Asn 
100 


Asn 


Glu 


His 


Phe 


Glu 
105 


Gin 


Met 


Pro Thr Arg 
110 


Asn 


Gly 




Thr 


Lys 


Ala 


Asp 


Lys 


Asp 


Asn 


Leu 


Thr 


Asn 


Leu 


Phe Arg Cys 


Met 


Gly 


40 






115 










120 








125 








Tyr 


Thr 


Val 


He 


Cys 


Lys 


Asp 


Asn 


Leu 


Thr 


Gly 


Arg Gly Met 


Leu 


Leu 
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130 135 140 

Thr lie Arg Asp Phe Ala Lys His Glu Ser His Gly Asp Ser Ala lie 
145 150 155 160 

Leu Val lie Leu Ser His Gly Glu Glu Asn Val lie lie Gly Val Asp 
5 165 170 175 

Asp lie Pro lie Ser Thr His Glu lie Tyr Asp Leu Leu Asn Ala Ala 

180 185 190 

Asn Ala Pro Arg Leu Ala Asn Lys Pro Lys lie Val Phe Val Gin Ala 
195 200 205 

10 Cys Arg Xaa Glu Arg Arg Asp Asn Gly Phe Pro Val Leu Asp Ser Val 
210 215 220 

Asp Gly Val Pro Ala Phe Leu Arg Arg Gly Trp Asp Asn Arg Asp Gly 
225 230 235 240 

Pro Leu Phe Asn Phe Leu Gly Cys Val Arg Pro Gin Val Xaa Gin Val 
15 245 250 255 

Trp Arg Lys Lys Pro Ser Xaa Ala Asp lie Leu lie Arg Tyr Ala Thr 

260 265 270 

Thr Ala Gin Tyr Val Ser Xaa Arg Asn Ser Ala Arg Gly Ser Trp Phe 
275 280 285 

20 lie Gin Ala Val Cys Glu Val Phe Ser Thr His Xaa Lys Asp Met Asp 
n| 290 295 300 

% Val Val Glu Leu Leu Thr Glu Val Asn Lys Lys Val Xaa Cys Gly Phe 

5j 305 310 315 320 

S Gin Thr Ser Gin Gly Ser Asn lie Leu Lys Gin Met Pro Xaa Met Thr 

S 25 325 330 335 

Xaa Arg Leu Leu Lys Lys Phe Tyr Phe Trp Pro Glu Ala Arg Asn Ser 
£ 340 345 350 

K Ala Val 



2? 30 (2) INFORMATION FOR SEQ ID NO: 21: 

= 

[U (i) SEQUENCE CHARACTERISTICS: 

W (A) LENGTH: 131 amino acids 

y (B) TYPE: amino acid 

H= (C) STRANDEDNESS : single 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: C- terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Val Asp Gly Val Pro Ala Phe Leu Arg Arg Gly Trp Asp Asn Arg Asp 
40 1 5 10 * 15 

Gly Pro Leu Phe Asn Phe Leu Gly Cys Val Arg Pro Gin Val Xaa Gin 

20 25 30 

Val Trp Arg Lys Lys Pro Ser Xaa Ala Asp He Leu He Arg Tyr Ala 
35 40 45 

45 Thr Thr Ala Gin Tyr Val Ser Xaa Arg Asn Ser Ala Arg Gly Ser Trp 
50 55 60 
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Phe He Gin Ala Val Cys Glu Val Phe Ser Thr His Xaa Lys Asp Met 
65 70 75 80 

Asp Val Val Glu Leu Leu Thr Glu Val Asn Lys Lys Val Xaa Cys Gly 
85 90 95 

5 Phe Gin Thr Ser Gin Gly Ser Asn He Leu Lys Gin Met Pro Xaa Met 
100 105 110 

Thr Xaa Arg Leu Leu Lys Lys Phe Tyr Phe Trp Pro Glu Ala Arg Asn 

115 120 125 

Ser Ala Val 
10 130 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 223 amino acids 

(B) TYPE: amino acid 

15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

20 Arg Ser Arg Ser Arg Ser Arg Ala Leu His Ser Ser Asp Arg His Asn 
1 5 10 15 

Tyr Ser Ser Pro Pro Val Asn Ala Phe Pro Ser Gin Pro Ser Ser Ala 

20 25 30 

Asn Ser Ser Phe Thr Gly Cys Ser Ser Leu Gly Tyr Ser Ser Ser Arg 
25 35 40 45 

Asn Arg Ser Phe Ser Lys Ala Ser Gly Pro Thr Gin Tyr lie Phe His 

50 55 60 

Glu Glu Asp Met Asn Phe Val Asp Ala Pro Thr lie Ser Arg Val Phe 
65 70 75 80 

30 Asp Glu Lys Thr Met Tyr Arg Asn Phe Ser Ser Pro Arg Gly Met Cys 

85 90 95 

Leu He He Asn Asn Glu His Phe Glu Gin Met Pro Thr Arg Asn Gly 

100 105 110 

Thr Lys Ala Asp Lys Asp Asn Leu Thr Asn Leu Phe Arg Cys Met Gly 
35 115 120 125 

Tyr Thr Val He Cys Lys Asp Asn Leu Thr Gly Arg Gly Met Leu Leu 

130 135 140 

Thr He Arg Asp Phe Ala Lys His Glu Ser His Gly Asp Ser Ala He 
145 150 155 160 

40 Leu Val He Leu Ser His Gly Glu Glu Asn Val He He Gly Val Asp 

165 170 175 

Asp He Pro He Ser Thr His Glu He Tyr Asp Leu Leu Asn Ala Ala 

180 185 190 

Asn Ala Pro Arg Leu Ala Asn Lys Pro Lys He Val Phe Val Gin Ala 
45 195 200 205 

Cys Arg Xaa Glu Arg Arg Asp Asn Gly Phe Pro Val Leu Asp Ser 
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210 215 220 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 294 amino acids 

5 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Phe Pro Ala Pro Gin Ala Val Gin Asp Asn Pro Ala Met Pro Thr Ser 

15 10 15 

Ser Gly Ser Glu Gly Asn Val Lys Leu Cys Ser Leu Glu Glu Ala Gin 
20 25 30 

15 Arg He Trp Lys Gin Lys Ser Ala Glu He Tyr Pro He Met Asp Lys 
35 40 45 

- Ser Ser Arg Thr Arg Leu Ala Leu He He Cys Asn Glu Glu Phe Asp 
50 55 60 

Ser He Pro Arg Arg Thr Gly Ala Glu Val Asp He Thr Gly Met Thr 
20 65 70 75 80 

Met Leu Leu Gin Asn Leu Gly Tyr Ser Val Asp Val Lys Lys Asn Leu 

85 90 95 

Thr Ala Ser Asp Met Thr Thr Glu Leu Glu Ala Phe Ala His Arg Pro 
100 105 HO 

25 Glu His Lys Thr Ser Asp Ser Thr Phe Leu Val Phe Met Ser His Gly 
115 120 125 

He Arg Glu Gly He Cys Gly Lys Lys His Ser Glu Gin Val Pro Asp 

130 135 140 

He Leu Gin Leu Asn Ala He Phe Asn Met Leu Asn Thr Lys Asn Cys 
30 145 150 155 160 

Pro Ser Leu Lys Asp Lys Pro Lys Val He He He Gin Ala Xaa Arg 

165 170 175 

Xaa Asp Ser Pro Gly Val Val Trp Phe Lys Asp Ser Val Gly Val Ser 
180 185 190 

3 5 Gly Asn Leu Ser Leu Pro Thr Thr Glu Glu Phe Glu Asp Asp Ala He 
195 200 205 

Lys Lys Ala His He Xaa Lys Asp Phe He Ala Phe Cys Ser Ser Thr 

210 215 220 

Pro Asp Asn Val Ser Xaa Arg His Pro Thr Met Gly Ser Val Phe He 
40 225 230 2^35 240 

Gly Arg Leu He Glu His Met Gin Glu Tyr Xaa Cys Ser Cys Asp Val 

245 250 255 

Glu Glu He Phe Arg Lys Val Arg Phe Ser Phe Glu Gin Pro Asp Gly 
260 265 270 

45 Arg Ala Gin Met Pro Thr Thr Xaa Arg Val Xaa Leu Thr Arg Cys Phe 
275 280 285 
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Tyr Leu Phe Pro Gly His 
290 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
10 (v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



15 



20 



Ser Val 


Gly Val 


Ser 


Gly 


Asn 


Leu 


Ser 


Leu 


Pro 


Thr 


Thr 


Glu 


Glu 


Phe 


1 






5 










10 










15 




Glu Asp 


Asp 


Ala 


He 


Lys 


Lys 


Ala 


His 


He 


Xaa 


Lys 


Asp 


Phe 


He 


Ala 






20 










25 










30 






Phe Cys 


Ser 


Ser 


Thr 


Pro 


Asp 


Asn 


Val 


Ser 


Xaa 


Arg 


His 


Pro 


Thr 


Met 




35 










40 










45 








Gly Ser 


Val 


Phe 


He 


Gly 


Arg 


Leu 


He 


Glu 


His 


Met 


Gin 


Glu 


Tyr 


Xaa 


50 










55 










60 










Cys Ser 


Cys 


Asp 


Val 


Glu 


Glu 


He 


Phe 


Arg 


Lys 


Val 


Arg 


Phe 


Ser 


Phe 


65 








70 










75 










80 


Glu Gin 


Pro 


Asp 


Gly 


Arg 


Ala 


Gin 


Met 


Pro 


Thr 


Thr 


Xaa 


Arg 


Val 


Xaa 








85 










90 










95 




Leu Thr 


Arg 


Cys 


Phe 


Tyr 


Leu 


Phe 


Pro 


Gly 


His 













25 100 105 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 187 amino acids 

(B) TYPE: amino acid 

3 0 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

35 Phe Pro Ala Pro Gin Ala Val Gin Asp Asn Pro Ala Met Pro Thr Ser 
15 10 15 

Ser Gly Ser Glu Gly Asn Val Lys Leu Cys Ser Leu Glu Glu Ala Gin 

20 25 30 

Arg He Trp Lys Gin Lys Ser Ala Glu He Tyr Pro He Met Asp Lys 
40 35 40 45 

Ser Ser Arg Thr Arg Leu Ala Leu He He Cys Asn Glu Glu Phe Asp 
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50 55 60 

Ser lie Pro Arg Arg Thr Gly Ala Glu Val Asp lie Thr Gly Met Thr 
65 70 75 80 

Met Leu Leu Gin Asn Leu Gly Tyr Ser Val Asp Val Lys Lys Asn Leu 
5 85 90 95 

Thr Ala Ser Asp Met Thr Thr Glu Leu Glu Ala Phe Ala His Arg Pro 

100 105 110 

Glu His Lys Thr Ser Asp Ser Thr Phe Leu Val Phe Met Ser His Gly 
115 120 125 

10 lie Arg Glu Gly lie Cys Gly Lys Lys His Ser Glu Gin Val Pro Asp 
130 135 140 

lie Leu Gin Leu Asn Ala lie Phe Asn Met Leu Asn Thr Lys Asn Cys 
145 150 155 160 

Pro Ser Leu Lys Asp Lys Pro Lys Val lie lie lie Gin Ala Xaa Arg 
15 165 170 175 

Xaa Asp Ser Pro Gly Val Val Trp Phe Lys Asp 
180 185 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 172 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
25 (v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Met Leu Thr Val Gin Val Tyr Arg Thr Ser Gin Lys Cys Ser Ser Ser 

15 10 15 

Lys His Val Val Glu Val Leu Leu Asp Pro Leu Gly Thr Ser Phe Cys 
30 20 ' 25 30 

Ser Leu Leu Pro Pro Pro Leu Leu Leu Tyr Glu Thr Asp Arg Gly Val 

35 40 45 

Asp Gin Gin Asp Gly Lys Asn His Thr Gin Ser Pro Gly Cys Glu Glu 
50 55 60 

3 5 Ser Asp Ala Gly Lys Glu Glu Leu Met Lys Met Arg Leu Pro Thr Arg 
65 70 75 80 

Ser Asp Met lie Cys Gly Tyr Ala Cys Leu Lys Gly Asn Ala Ala Met 

85 90 95 

Arg Asn Thr Lys Arg Gly Ser Trp Tyr lie Glu Ala Leu Thr Gin Val 
40 100 105 110 

Phe Ser Glu Arg Ala Xaa Cys Asp Met His Val Ala Asp Met Leu Val 

115 120 125 

Lys Val Asn Ala Leu lie Lys Glu Arg Glu Gly Tyr Ala Pro Gly Thr 
130 135 140 

45 Glu Phe His Arg Cys Lys Glu Met Ser Glu Tyr Cys Ser Thr Leu Cys 
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145 150 155 160 

Gin Gin Leu Tyr Leu Phe Pro Gly Tyr Pro Pro Thr 
165 170 

(2) INFORMATION FOR SEQ ID NO: 27: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 172 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

Met Leu Thr Val Gin Val Tyr Arg Thr Ser Gin Lys Cys Ser Ser Ser 
15 10 15 

15 Lys His Val Val Glu Val Leu Leu Asp Pro Leu Gly Thr Ser Phe Cys 

« 20 25 30 

W Ser Leu Leu Pro Pro Pro Leu Leu Leu Tyr Glu Thr Asp Arg Gly Val 

35 40 45 

W Asp Gin Gin Asp Gly Lys Asn His Thr Gin Ser Pro Gly Cys Glu Glu 

OS 20 50 55 60 

W Ser Asp Ala Gly Lys Glu Glu Leu Met Lys Met Arg Leu Pro Thr Arg 

FU 65 70 75 80 

4S Ser Asp Met lie Cys Gly Tyr Ala Cys Leu Lys Gly Asn Ala Ala Met 

W 85 90 95 

s 25 Arg Asn Thr Lys Arg Gly Ser Trp Tyr lie Glu Ala Leu Thr Gin Val 

0 100 105 110 

01 Phe Ser Glu Arg Val Xaa Cys Asp Met His Val Ala Asp Met Leu Val 
ftj 115 120 125 

fy Lys Val Asn Ala Leu lie Lys Glu Arg Glu Gly Tyr Ala Pro Gly Thr 

O 30 130 135 140 

u, Glu Phe His Arg Cys Lys Glu Met Ser Glu Tyr Cys Ser Thr Leu Cys 

145 150 155 160 

Gin Gin Leu Tyr Leu Phe Pro Gly Tyr Pro Pro Thr 
165 170 
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